智能论文笔记

Artificial Intelligence Security Competition (AISC)

Yinpeng Dong , Peng Chen , Senyou Deng , Lianji L , Yi Sun , Hanyu Zhao , Jiaxing Li , Yunteng Tan , Xinyu Liu , Yangyi Dong

分类：人工智能 | 计算机视觉 | 机器学习

2022-12-07

The security of artificial intelligence (AI) is an important research area towards safe, reliable, and trustworthy AI systems. To accelerate the research on AI security, the Artificial Intelligence Security Competition (AISC) was organized by the Zhongguancun Laboratory, China Industrial Control Systems Cyber Emergency Response Team, Institute for Artificial Intelligence, Tsinghua University, and RealAI as part of the Zhongguancun International Frontier Technology Innovation Competition (https://www.zgc-aisc.com/en). The competition consists of three tracks, including Deepfake Security Competition, Autonomous Driving Security Competition, and Face Recognition Security Competition. This report will introduce the competition rules of these three tracks and the solutions of top-ranking teams in each track.

translated by 谷歌翻译

A Waste Copper Granules Rating System Based on Machine Vision

Kaikai Zhao , Yajie Cui , Zhaoxiang Liu , Shiguo Lian

分类：计算机视觉

2022-07-11

在废物铜颗粒回收的领域，工程师应该能够识别废物铜颗粒中的各种杂质，并在评级之前估计其质量比例。这种手动评级方法是昂贵的，缺乏客观性和全面性。为了解决这个问题，我们建议基于机器视觉和深度学习的废铜颗粒评级系统。我们首先将评级任务提出为2D图像识别和纯度回归任务。然后，我们设计了一个两阶段的卷积等级网络，以计算废物铜颗粒的质量纯度和评级水平。我们的评分网络包括分割网络和一个纯度回归网络，该网络分别计算废物铜颗粒的语义分割热图和纯度结果。在训练增强数据集上的评级网络之后，对真正的废铜颗粒进行了实验，证明了拟议网络的有效性和优势。具体而言，就准确性，有效性，鲁棒性和客观性而言，我们的系统优于手动方法。

translated by 谷歌翻译

DenseGAP: Graph-Structured Dense Correspondence Learning with Anchor Points

Zhengfei Kuang , Jiaman Li , Mingming He , Tong Wang , Yajie Zhao

分类：计算机视觉

2021-12-13

在两个图像之间建立密集对应是基本计算机视觉问题，通常通过匹配本地特征描述符来解决。然而，如果没有全球意识，这种本地特征通常不足以消除类似地区。并计算图像的成对特征相关性是计算昂贵和内存密集型。为了使本地特征意识到全球背景并提高其匹配的准确性，我们介绍了DendeGap，一种新的解决方案，以获得高效密集的信念学习，在锚点上调节图形结构化神经网络。具体地，我们首先提出利用锚点的曲线图结构，以在和图像间的情况下之前提供稀疏但可靠，并通过定向边沿传播到所有图像点。我们还通过光加权消息传递层设计了图形结构化网络以广播多级上下文，并以低内存成本生成高分辨率特征映射。最后，基于预测的特征图，我们使用循环一致性引入用于准确的对应预测的粗略框架。我们的特征描述符捕获本地和全局信息，从而启用一个连续的特征字段，用于以高分辨率查询任意点。通过对大型室内和室外数据集的全面的消融实验和评估，我们证明我们的方法在大多数基准上推动了最先进的函授学习。

translated by 谷歌翻译

An Information-Theoretic Approach to Transferability in Task Transfer Learning

Yajie Bao , Yang Li , Shao-Lun Huang , Lin Zhang , Lizhong Zheng , Amir Zamir , Leonidas Guibas

分类：机器学习 | 计算机视觉

2022-12-20

Task transfer learning is a popular technique in image processing applications that uses pre-trained models to reduce the supervision cost of related tasks. An important question is to determine task transferability, i.e. given a common input domain, estimating to what extent representations learned from a source task can help in learning a target task. Typically, transferability is either measured experimentally or inferred through task relatedness, which is often defined without a clear operational meaning. In this paper, we present a novel metric, H-score, an easily-computable evaluation function that estimates the performance of transferred representations from one task to another in classification problems using statistical and information theoretic principles. Experiments on real image data show that our metric is not only consistent with the empirical transferability measurement, but also useful to practitioners in applications such as source model selection and task transfer curriculum learning.

translated by 谷歌翻译

An Empirical Study on Multi-Domain Robust Semantic Segmentation

Yajie Liu , Pu Ge , Qingjie Liu , Shichao Fan , Yunhong Wang

分类：计算机视觉

2022-12-08

How to effectively leverage the plentiful existing datasets to train a robust and high-performance model is of great significance for many practical applications. However, a model trained on a naive merge of different datasets tends to obtain poor performance due to annotation conflicts and domain divergence.In this paper, we attempt to train a unified model that is expected to perform well across domains on several popularity segmentation datasets.We conduct a detailed analysis of the impact on model generalization from three aspects of data augmentation, training strategies, and model capacity.Based on the analysis, we propose a robust solution that is able to improve model generalization across domains.Our solution ranks 2nd on RVC 2022 semantic segmentation task, with a dataset only 1/3 size of the 1st model used.

translated by 谷歌翻译

A Data-Centric AI Paradigm Based on Application-Driven Fine-grained Dataset Design

Huan Hu , Yajie Cui , Zhaoxiang Liu , Shiguo Lian

分类：计算机视觉

2022-09-20

深度学习在工业场景中具有广泛的应用，但是减少虚假警报（FA）仍然是一个主要困难。优化网络体系结构或网络参数用于在学术界解决这一挑战，同时忽略了应用程序场景中数据的基本特征，这通常会导致新场景中的FA增加。在本文中，我们提出了一个新颖的范式，用于由工业应用驱动的数据集的细粒度设计。我们根据数据和应用程序要求的基本特征灵活地选择正面和负面样本集，并将其余样本添加到训练集中作为不确定性类别。我们收集了10,000多个戴面膜识别样本，涵盖了各种应用程序方案作为我们的实验数据。与传统的数据设计方法相比，我们的方法可获得更好的结果并有效地减少了FA。我们为研究社区提供所有贡献，以提供更广泛的使用。该贡献将在https://github.com/huh30/opendatasets上获得。

translated by 谷歌翻译

Fast Composite Optimization and Statistical Recovery in Federated Learning

Yajie Bao , Michael Crawshaw , Shan Luo , Mingrui Liu

分类：机器学习 | (统计)机器学习

2022-07-17

作为一个普遍的分布式学习范式，联邦学习（FL）训练了大量通信的大量设备的全球模型。本文研究了FL设置中的一类复合优化和统计恢复问题，其损失函数由数据依赖的平滑损耗和非平滑正常器组成。示例包括使用套索的稀疏线性回归，使用核标准正则化等等的低级矩阵恢复等。在现有文献中，联合复合优化算法仅从优化的角度设计，而无需任何统计保证。此外，他们不考虑在统计恢复问题中常用（受限）强凸度。从优化和统计角度来看，我们都会推进此问题的前沿。从优化的前期，我们提出了一种名为\ textit {快速联合双平均}的新算法，用于强烈凸出和平滑损失，并在复合设置中建立最新的迭代和通信复杂性。特别是，我们证明它具有快速的速度，线性加速和减少的沟通回合。从统计前期开始，对于受限制的强烈凸出和平滑损失，我们设计了另一种算法，即\ textIt {多阶段联合双重平均}，并证明了与线性加速绑定到最佳统计精度的高概率复杂性。合成数据和真实数据的实验表明，我们的方法的性能优于其他基线。据我们所知，这是为FL中复合问题提供快速优化算法和统计恢复保证的第一项工作。

translated by 谷歌翻译

Enhancing Clean Label Backdoor Attack with Two-phase Specific Triggers

Nan Luo , Yuanzhang Li , Yajie Wang , Shangbo Wu , Yu-an Tan , Quanxin Zhang

分类：计算机视觉

2022-06-10

后门攻击威胁着深度神经网络（DNNS）。对于隐身性，研究人员提出了清洁标签的后门攻击，这要求对手不要更改中毒训练数据集的标签。由于正确的图像标签对，清洁标签的设置使攻击更加隐秘，但仍然存在一些问题：首先，传统的中毒训练数据方法无效；其次，传统的触发器并不是仍然可感知的隐形。为了解决这些问题，我们提出了一种两相和特定图像的触发器生成方法，以增强清洁标签的后门攻击。我们的方法是（1）功能强大：我们的触发器都可以同时促进后门攻击中的两个阶段（即后门植入和激活阶段）。（2）隐身：我们的触发器是从每个图像中生成的。它们是特定于图像的而不是固定触发器。广泛的实验表明，我们的方法可以达到奇妙的攻击成功率〜（98.98％），中毒率低（5％），在许多评估指标下高隐身，并且对后门防御方法有抵抗力。

translated by 谷歌翻译

A Survey on Unsupervised Visual Industrial Anomaly Detection Algorithms

Yajie Cui , Zhaoxiang Liu , Shiguo Lian

分类：计算机视觉

2022-04-24

与行业4.0的发展相一致，越来越多的关注被表面缺陷检测领域所吸引。提高效率并节省劳动力成本已稳步成为行业领域引起人们关注的问题，近年来，基于深度学习的算法比传统的视力检查方法更好。尽管现有的基于深度学习的算法偏向于监督学习，但这不仅需要大量标记的数据和大量的劳动力，而且还效率低下，并且有一定的局限性。相比之下，最近的研究表明，无监督的学习在解决视觉工业异常检测的高于缺点方面具有巨大的潜力。在这项调查中，我们总结了当前的挑战，并详细概述了最近提出的针对视觉工业异常检测的无监督算法，涵盖了五个类别，其创新点和框架详细描述了。同时，提供了包含表面图像样本的公开可用数据集的信息。通过比较不同类别的方法，总结了异常检测算法的优点和缺点。预计将协助研究社区和行业发展更广泛，更跨域的观点。

translated by 谷歌翻译

Saliency-Aware Spatio-Temporal Artifact Detection for Compressed Video Quality Assessment

Liqun Lin , Yang Zheng , Weiling Chen , Chengdong Lan , Tiesong Zhao

分类：计算机视觉

2023-01-03

Compressed videos often exhibit visually annoying artifacts, known as Perceivable Encoding Artifacts (PEAs), which dramatically degrade video visual quality. Subjective and objective measures capable of identifying and quantifying various types of PEAs are critical in improving visual quality. In this paper, we investigate the influence of four spatial PEAs (i.e. blurring, blocking, bleeding, and ringing) and two temporal PEAs (i.e. flickering and floating) on video quality. For spatial artifacts, we propose a visual saliency model with a low computational cost and higher consistency with human visual perception. In terms of temporal artifacts, self-attention based TimeSFormer is improved to detect temporal artifacts. Based on the six types of PEAs, a quality metric called Saliency-Aware Spatio-Temporal Artifacts Measurement (SSTAM) is proposed. Experimental results demonstrate that the proposed method outperforms state-of-the-art metrics. We believe that SSTAM will be beneficial for optimizing video coding techniques.

translated by 谷歌翻译